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US -CL- CURRENT: 717 / 158 ; 711 / 125 , 711 / 134 
ABSTRACT: 

A system and method are provided that allows the results of an instruction trace 
mechanism to globally restructure the instructions. The process reorders the 
instructions in an executable program, using an actual execution profile (or 
instruction address trace) for a selected workload, to improve utilization of the 
existing hardware architecture. The reordering of instructions is implemented at a 
global level (i.e., independent of procedure or other structural boundaries which 
maximizes speedup) running on various hardware platforms and adds the ability to 
preserve correctness and debuggability for reordered executables . An unconditional 
branch instruction is added at the memory locations where reordered instructions 
previously were stored. When a dynamic branch occurs, the program will attempt to 
access the instruction at the original address and the unconditional branch directs 
the program to the reordered location of the instruction and program integrity is 
maintained. 
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US - CL - CURRENT : 717 / 129 
ABSTRACT : 

A microprocessor performance monitor and instruction address break point facility- 
are interconnected to provide finer granularity and performance monitoring. The 
microprocessor is initialized to collect processor statistics preselected prior to 
performance monitoring. Application start and stop instruction breakpoint addresses 
are preselected from a software program bounding instructions for which such 
statistics are desired. An exception handler is installed for instruction address 
breakpoints (IAB) , enabling and disabling the performance monitor and stop 
addresses, respectively. The IAB register is then initalized to the start address, 
and the statistics counters are cleared. Upon starting the application, when the 
application start address instruction is executed, the breakpoint handler obtains 
control and enables the performance monitor counters, which count the desired 
statistics after returning from the breakpoint handler. Before returning, the 
handler sets the IAB register to the stop address. When the application stop address 
is encountered, the breakpoint handler disables the performance monitor counters, 
and rearms the start address in the IAB register. The performance monitor counters 
are then read to determine the desired statistics for the specific sequence of code 
within the boundaries of the start and stop addresses in the application. 

27 Claims, 3 Drawing figures 
Exemplary Claim Number: 1 
Number of Drawing Sheets: 3 
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US -CL- CURRENT: 717 / 158 
ABSTRACT : 

The present invention is an system and method for optimizing a program, having 
qualified elements, at the source level. The method includes the steps of 
instrumenting each path of the qualified elements to create an instrumented program, 
executing the instrumented program to produce instrumentation information for each 
of the paths, analyzing the instrumentation information, and in response to the 
analyzing step, restructuring the program to create an optimize program. 

1 Claims, 6 Drawing figures 
Exemplary Claim Number: 1 
Number of Drawing Sheets: 5 
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TITLE: Method for optimizing computer code to provide more efficient execution on 
computers having cache memories 

DATE- ISSUED: May 18, 1993 



INVENTOR- INFORMATION : 
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Pettis ; Karl W. 
Hansen; Robert C. 



CITY 

San Jose 
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STATE 
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COUNTRY 



US -CL- CURRENT: 717/153; 714/38 
ABSTRACT : 

The method uses statistical information obtained by running the computer code with 
test data to determine a new ordering for the code blocks. The new order places code 
blocks that are often executed after one another close to one another in the 
computer's memory. The method first generates chains of basic blocks, and then 
merges the chains. Finally, basic blocks that were not executed by the test data 
that was used to generate the statistical information are moved to a distant 
location to allow the blocks that were used to be more closely grouped together. 

6 Claims, 7 Drawing figures 
Exemplary Claim Number: 1,5 
Number of Drawing Sheets: 5 
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